Hierarchical clustering to split links that are too large
Hierarchical clustering to split links that are too large
You can use words, links, and other information that appears on the page as information for clustering.
What is alike is near and what is apart is far
If you set an appropriate threshold, that tag will be split into several groups.
---
This page is auto-translated from /nishio/階層的クラスタリングによる大きすぎるリンクの分割. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers.